Language Identification through Parallel Phone Recognition

نویسندگان

C. S. Chou

M. A. Zissman

C. S. CHOU

چکیده

Language identification systems that employ acoustic likelihoods from languagedependent phoneme recognizers to perform language classification have been shown to yield high performance on clean speech. In this report, such a method was applied to language identification of telephone speech. Phoneme recognizers were developed for English, German, Japanese, Mandarin, and Spanish using hidden Markov models. Each of these processed the input speech and output a phoneme sequence in their respective languages along with a likelihood score. The language of the incoming speech was hypothesized as the language of the model having the highest likelihood. The main differences between this system and those developed in the past are that this system processed telephone speech, could identify up to five languages, and used phonetic transcriptions to train the language-specific models. The five-language, forced-choice recognition rate on 45-s utterances was 71.9%. On 10-s utterances the recognition decreased to 70.3%. In addition, it was found that adding word-specific phonemes to the training set had a negligible effect on language identification results. in j Aoosssion For i ms QRAkl öJr ifoar-rvouijoed Q Ju s t '■. '■: 1 o a t i c n ÄjT•TH s'(y'< hi^t.l m ZXOV.tlO-Qi $ ■/''vail and/or Speoial

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Language Identification of Telephone Speech

II Lincoln Laboratory has investigated the development of a system that can automatically identify the language of a speech utterance. To perform the task of automatic language identification, we have experimented with four approaches: Gaussian mixture model classification; single-language phone recognition followed by language modeling (PRLM); parallel PRLM, which uses multiple single-language...

متن کامل

Comparison of four approaches to automatic language identification of telephone speech

AbstructWe have compared the performance of four approaches for automatic language identification of speech utterances: Gaussian mixture model (GMM) classification; single-language phone recognition followed by languagedependent, interpolated n-gram language modeling (PRLM); parallel PRLM, which uses multiple single-language phone recognizers, each trained in a different language; and languaged...

متن کامل

Language identification using parallel sub-word recognition - an ergodic HMM equivalence

Recently, we have proposed a parallel sub-word recognition (PSWR) system for language identification (LID) in a framework similar to the parallel phone recognition (PPR) approach in the literature, but without requiring phonetic labeling of the speech data in any of the languages in the LID task. In this paper, we show the theoretical equivalence of PSWR and ergodicHMM (E-HMM) based LID. Here, ...

متن کامل

Parallel Acoustic Model Adaptation for Improving Phonotactic Language Recognition

In phonotactic language recognition systems, the use of acoustic model adaptation prior to phone lattice decoding has been proposed to deal with the mismatch between training and test conditions. In this paper, a novel approach using diversified phonotactic features from parallel acoustic model adaptation is proposed. Specifically, the parallel model adaptation involves independent mean-only an...

متن کامل

Fusion of contrastive acoustic models for parallel phonotactic spoken language identification

This paper investigates combining contrastive acoustic models for parallel phonotactic language identification systems. PRLM, a typical phonotactic system, uses a phone recogniser to extract phonotactic information from the speech data. Combining multiple PRLM systems together forms a Parallel PRLM (PPRLM) system. A standard PPRLM system utilises multiple phone recognisers trained on different ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 1998

Language Identification through Parallel Phone Recognition

نویسندگان

چکیده

منابع مشابه

Automatic Language Identification of Telephone Speech

Comparison of four approaches to automatic language identification of telephone speech

Language identification using parallel sub-word recognition - an ergodic HMM equivalence

Parallel Acoustic Model Adaptation for Improving Phonotactic Language Recognition

Fusion of contrastive acoustic models for parallel phonotactic spoken language identification

عنوان ژورنال:

اشتراک گذاری